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m ' ABSTRACT 

on : 

ON ■ 

To examine how peculiar velocities can affect the 2-, 3-, and 4-point redshift correlation functions, we 
. evaluate volume-average correlations for configurations that emphasize and minimize redshift distortions 
for four different volume-limited samples from each of the CfA, SSRS, and IRAS redshift catalogs. We 
\& ' present the results as the correlation length tq and power index 7 of the 2-point correlation, £2 = ( r o/ r ) 7 ! 
and as the hierarchical amplitudes of the 3- and 4-point functions, S3 = £3/^2 and 54 = £4/£ 2 3 - 

We find a characteristic distortion for £2'- the slope 7 is flatter and the correlation length is larger in 
redshift space than in real space; that is, redshift distortions "move" correlations from small to large scales. 
At the largest scales (up to 12/i _1 Mpc), the extra power in the redshift distribution is compatible with 
n 4 / 7 /b « 1. We estimate Q 4 / 7 /6 to be 0.53 ±0.15, 1.10 ±0.16 and 0.84 ±0.45 for the CfA, SSRS and IRAS 

m : 

. catalogs. 

■ Higher order correlations £3 and £4 suffer similar redshift distortions, but in such a way that, within the 

amplitudes S3 and S4 are constant as a function of scale between 1-12 h Mpc and have similar values in 
all samples and catalogues, S3 « 2 and 54 « 6, despite the fact that £2, £3; and £4 differ from one sample 
k> ■ to another by large factors (up to a factor of 4 in £2, 8 for £3, and 12 for £4). 

^ ■ The agreement between the independent estimations of S3 and S4 is remarkable given the different 

criteria in the selection of galaxies and also the difference in the resulting range of densities, luminosities 
and locations between samples. 
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1 Introduction 



Measurements of the galaxy 2-, 3-, and 4-point correlation functions, £2, £3, and £4, indicate that a 
hierarchical clustering structure, with £j = Sj^2~ l , holds both in angular catalogs (Groth & Peebles 
1977; Fry & Peebles 1978; Szapudi, Boschan, & Szalay 1992; Meiksin, Szapudi, & Szalay 1992) and in 
redshift catalogs (Bouchet, Davis, & Strauss 1993; Gaztahaga 1992), and both for IRAS and optically 
selected galaxies. The values obtained for the hierarchical amplitudes Sj do not completely agree between 
different analyses (see Table 8 below), and the question arises whether this is caused by redshift distortions 
or by differences in the techniques and selections employed, or arises from intrinsic differences in the 
samples. 

Previous analyses of galaxy correlation functions have focused either on the projected distribution 
inferred from angular data or on the redshift distribution, but there are very few cases where both angular 
and redshift correlations have been estimated for the same sample (one notable exception for the 2-point 
correlation function is Davis & Peebles 1983). Deriving statistical properties from angular data requires 
many galaxies to compensate for the decrease in the projected amplitude, and introduces uncertainties 
from the selection function. With the addition of redshift information, the selection function is better 
known, but the total number of galaxies in redshift samples is usually orders of magnitude smaller because 
of the larger observational time required to take a single redshift. As a result, it is difficult to perform both 
analyses for a single catalog. In particular, for the 3- and 4-point correlation functions we do not know of 
any previous reference where both redshift and angular analyses are considered over identical data. 

Statistics of galaxy clustering inferred from redshift data are likely to be distorted radially by peculiar 
velocities. One known effect is the "fingers of God" : the large velocities in a tightly bound cluster disperse 
the apparent positions of galaxies along the line of sight, such that in the extreme, clusters appear as 
narrow needles pointed back at the observer. Since typical peculiar velocities are a few hundred kilometers 
per second, one might hope that by avoiding clusters or by looking on larger scales this effect could 
be rendered unimportant. However, fluctuations are also smaller on large scales and there are coherent 
streaming effects. Kaiser (1987) showed in perturbation theory that on large scales velocity distortions 
are of the same order as density fluctuations and distort large scale perturbation amplitudes by a factor 
[1 + f(Q) cos 2 6], where 9 is the angle between the perturbation wave vector and the line of sight and 
/(fi) ~ r^ 4 / 7 . Averaged over a spherical volume, fluctuations in redshift space are thus enhanced by a 
factor [1 + f(Q)/3]. Thus, we expect possible redshift distortions on all scales. 

The effect on higher order moments is less clear. Naively, the Kaiser analysis would seem to indicate 
that the J-point hierarchical redshift amplitudes Sj should be smaller than the real amplitudes Sj, by 
a factor Sj/Sj ~ [1 + f(Q)/3] 1 ~ J , and therefore be sensitive to However, extending the analysis to 
second order perturbation theory, required to compute the 3-point function consistently, Bouchet et al. 
(1992) claim that the net effect of redshift distortions on S3 is small. Numerically, Lahav et al. (1993) 
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find in simulations on smaller scales that S3 is quite distorted in redshift space, while Coles et al. (1993) 
do not. Further complicating the theoretical expectations is the possibility of biasing, i.e. whether or not 
galaxies faithfully trace the matter distribution. For large scales, Fry and Gaztahaga (1993) have shown 
that a local biasing preserves the hierarchical structure to all orders but can change the values of the Sj. 
Kaiser's effect, arising from the peculiar velocity modulation of the distribution, applies equally to the 
galaxy number distribution, even when galaxies are not unbiased tracers of the mass, and in general all 
these effects are mixed at each order. 

To separate redshift distortions of observed correlations from other sources of difference, we believe 
that it is important to perform a dual analysis to the same data samples. Thus, in this paper we present 
an analysis in which identical techniques of estimation are applied over the same data samples to compute 
volume average correlations in redshift space for volumes that are expected to exhibit redshift distortions, 
and for configurations that are expected to minimal distortions. In § 2 we discuss measuring correlations 
from moments of counts in cells, including an analytic extension of the count-in-cell tails for finite volume 
samples. In § 3, we relate volume average parameters to intrinsic parameters for spherical and conical 
cells for power-law underlying correlations. Spherical cells are expected to experience redshift distortions, 
which should be minimal in conical cells. § 4 presents details of the samples we use and the data analysis. 
§ 5 contains a summary of results and a final discussion. 

2 Estimation of volume-average correlations 

We center our analysis on the the volume-average correlation functions, 

^ { y) = yj J v d 3 r 1 ...d 3 rjO(r 1 ,...,rj). (1) 

We use volume-limited samples so that no selection function appears. We assume that every region in 
redshift space is equally weighted and faithfully traced by the galaxies in each sample. 

We use the count-in-cell probabilities P n (V) to compute moments and obtain the £j(V). The correlation 
functions £j(V) can be found from the count-in-cell probabilities P n (V), the probability to find n galaxies 
in a randomly selected cell of volume V, from moments ((An)" 7 ). Including discreteness contributions (cf. 
Peebles 1980; Fry 1985), the first few moments are related to the £j(V) as 

(n) = N 

((An) 2 ) = N 2 l 2 + N 

((An) 3 ) = N 3 & + 3N 2 & + N 

((An) 4 ) -3 ((An) 2 ) 2 = N% + 6N 3 & + 7N 2 & + N (2) 
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2.1 Count-in-cell probabilities 

We estimate P n (V) in the following way: given the radius of a test-sphere, we choose randomly a center 
inside the survey sample, count the number of galaxies found inside the cell, and accumulate the number 
of cells N n with n = 0, 1, 2, 3... galaxies. We repeat this procedure Nt times, to be chosen so that 
the sampling spheres will overlap with each other. Counts-in-cell probabilities are then estimated by 
P n = N n /NT- The precision of this estimation is limited by the number of independent cells; from the 
Poisson distribution, we expect AP n /P n > (P n A^)~ 1//2 for P n -C 1. We have to choose Nt in order 
to estimate counts in cells with the right accuracy. A better choice is Nt = £ 2 Vr/V (Gaztahaga & 
Yokoyama 1993), but because £2 is not known a priori we must iterate the process to obtain Nt, using as 
a first approximation the value of £2 estimated from pair-counts. The results do not differ much between 
these two prescriptions, and therefore we have chosen the one that is more simple, i.e. Nt = 2Vt/V. 
Error bars for the P n are estimated from 90% confidence in different realizations of the positions of the 
independent cells, which agree well with the binomial estimate of errors above. 

2.2 Estimation of moments 

Once we have the P n (V) , we next calculate moments of different orders to estimate the correlation functions 
from equation (]2|). This brings us to the finite sample problem. The number of independent cells Nt used 
to estimate P n (V) is necessarily finite, because the size of the sample is finite. This limits the smallest 
value of P n that can be estimated, Pmm(V c ) ~ 1/Nt- There will also be a maximum cell count, n max - The 
probabilities P n for counts n > n max and with P n < P m i n can not be estimated, and thus the moments, 
nominally given by 

00 

(n m ) = Y,n m P n , (3) 

n=0 

can only be summed up to n = n max . An estimate of when this is an important effect is whether n max is 
small relative to N c = N^2, the number of galaxies in a "typical" cluster. If n max < N c , then the moments 
will be systematically underestimated, by an increasing amount for higher moments. When only a few rich 
clusters such as Virgo or Coma are present in a catalog (such as the CfA) there may also be a "bump" 
in the tail of P n . This "bump" may introduce a large artificial enhancement of the correlation functions. 
A further related problem is interaction with the boundary: when cell locations are restricted to avoid 
intersection with the boundaries, the size of the survey area covered gets smaller as the size of the cell 
is increased. This can introduce additional bias, as the center area of the sample is more weighted than 
the boundary, and, moreover, reduces even further the number of independent cells, so that P m i n is larger 
and n max is smaller, which again eventually results in underestimated moments. When the sample is large 
enough that finite sample size is not a problem, the P n appear to fall exponentially with n. This suggests 
a method to correct for the limitations of finite volume. 
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Modeling the tail, that is, extrapolating the P n to n > n max , can help to compensate for the underes- 
timation, or, at the least, can be used to study systematic errors in the moments. Both observations and 
various scale-invariant models suggest that the probability tails are exponential, behaving as P n oc e~ n / Nc , 
where N c = Thus, when n max > N c , the contribution to ^n m P n from n max to oo is small when 

compared with the whole sum in equation (||), as claimed above. Modeling also reduces the effects on the 
correlations that appears when rich clusters such as Coma are present. By fitting an exponential to the 
tail, the effect of a "bump" in the P n is diluted. 

We fit the tail of the observed counts-in-cells to an exponential function, (N n ) = \i n = Ce~ an , using 
a Poisson likelihood function, 



(4) 



. ly n- 

where N n are the observed counts (the number of cells with n galaxies) and /x n = Ce~ an . The values of 
C and a are found by maximization of the likelihood C. As a test we can verify that the observed counts 
in the tail follow a Poisson distribution around the exponential. In practice we construct a routine to 
find the value m where the "tail" begins, i.e. the value of % where the counts begin to fit an exponential 
(ideally, N <C n\ <C n max ), fit \i n = Ce~ an for n\ < n < n max , and extrapolate P n ~ \i n for n > ra max . We 
then must renormalize the distribution of counts so that Pn = 1- For small cell volume, the resulting 
correction to the moments is very small, as n max is already quite large, while for large cells, both the 
contribution from and the uncertainties in this modeling could be very large. To estimate the systematic 
errors introduced in the moments by modeling the exponential tail, we report in the results the difference 
between the "raw" moments (without modeling of the tails) and the modeled moments, 



Y,n m P n -J2 nmp n 



(5) 



A(n m )= 

n=0 n=0 

This provides a natural way to measure when the finite volume effects are large. 

3 Volume average correlations 

3.1 Redshift vs. real space correlations 

The choice of volume shape can emphasize or minimize the effects of peculiar velocities. As one choice, 
we compute moments over spheres of radius R in redshift space. As remarked above, the contribution of 
peculiar velocities to the apparent redshift distance are expected to distort the resulting statistics. We refer 
to the results from spherical volumes as "redshift" results. To extract information about correlations from 
a redshift catalog that is not distorted by the peculiar velocity modulation of the distribution, we simply 
change the shape of the volume, to cells with the shape of conic sectors of a sphere between radii d < r < D, 
of opening angle 6 around a radial axis (in our analysis, d is a fixed to 2500 km s -1 and D is the depth of 
the sample; see Table 1). The estimated counts-in-cells for these volumes are not distorted except perhaps 
at the inner and outer radial boundaries of the cell, where peculiar velocities can move a few galaxies in or 
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out of the cell, a small effect for deep samples. Because results from these conic volumes are not distorted 
by peculiar velocities, we refer to them in brief as "real space" results. As will become apparent below, the 
conic cell analysis is similar to considering just the projected, magnitude-limited angular distribution, with 
integrals similar to those in the standard angular analysis. However, since redshift information is used to 
determine the distance and absolute magnitudes of the galaxies in our samples, the resulting analog of the 
Limber equation (see Peebles 1980) for these volume-limited samples does not involve an integral over the 
luminosity function, thus removing a source of uncertainty. 

3.2 Power-law correlations 

In extracting the underlying correlation functions from the volume-averaged results, we assume a power- 
law two-point correlation function, €2(1") = (^o/ r ) 7 ; and a hierarchical three-point function, as suggested 
by our data (see Figs. la-c). The power law model ^(r) = (ro/r) 7 is not a good approximation for all 
possible ranges of scales (see i.e. Groth & Peebles 1977; Maddox et al. 1990) but, given the resolution in our 
analysis, it is quite a good one for the range of scales we have been able to inspect; i.e. R ~ 1-20 h^ 1 Mpc. 
The volume- average two-point correlation is 

UV) = ^J v d 3 nd 3 r 2 &(|ri - r 2 \). (6) 

For £2(0 = (?"o/ r ) 7 and spherical cells of radius R, we have 

& = S( 7 )(roAR) 7 

S{7) = (l- 7 /3)(l-7/4)(l-7/6)' (7) 
so that ^{R) is also a power law, with, in the absence of peculiar velocity distortions, the same slope as 
£2(7*) and with amplitude different by a factor 5(7) of order unity. For 7 = 2, S^t) = 9/4; for 7 = 1.5, 
S« 1.5. 

For a volume shaped as conic sector of an sphere of opening angle 9 around a radial axis between radii 
d < r < D we find for power-law £2(0 in the limit 8 <C 1, 

I2 = C(7)^ 7 
C7(7) = h Hl fr ° V 



II 2tt \D 



f 

J — < 



dx r(i/2)r( 7 /2 - 1/2) 



(1 + x 2 ) 7 / 2 r( 7 /2) 

F{z) = xdx d(f>{z 2 + x 2 -2zx cos 0)( 1 " 7 )/ 2 (8) 
Jo Jo 

so that £2 i s a l so a power law but with a different slope and (possibly greatly) reduced amplitude, ^i^) ~ 
£(D) 6 l ~"< . In each case, from the fitted slope and amplitude of observed £2 and from the expressions above, 
one can estimate the values of ro and 7 for the intrinsic two-point correlation function. 
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For higher order correlations we assume hierarchical properties, 6 = Sj^' 1 , again in agreement with 
our data (see Figs. 2-3) and previous analyses. For the 3-point correlation function we use the local 
hierarchical expression 

£3(1*1, r-2, r 3 ) = Q 3 [6(^12)6(^13) +6(^12)6(^23) +6(^13)6(^23)] (9) 
to compute the volume average 

600 = ^ ^rf 3 r 1 d 3 r 2 d 3 r 3 6(n,r 2 ,r 3 ). (10) 

We are interested in the hierarchical amplitudes, i.e. we want to estimate: S3 = 6(^0/£2 (^0- F° r a 
spherical volume, the hierarchical relation (0) with 6( r ) = ( r o/ r ) 7 implies: 



S§ = 3^(7)^3 

(6 - 7) 2 46 - 487 + 177^ - 2 7 a 

•nil) - 



3(2-7)2 



2(7-2 7 )(9- 2 7 ) 



(11) 



2 2 ^(2- 7 )r(5- 7 ) 2 2 ^r(5- 7 ) 
256r(9/2 - 7 ) 256r(ll/2- 7 ) 

(Gaztahaga & Yokoyama 1993). For < 7 < 2, S3 and 3Q3 differ by less than 3%, so that in practice one 
can use as a good approximation Sf ~ 3Q3. In our analysis, averages over spherical volumes and thus S3 
are obtained in redshift space. 

For the conic cell shape we have, in the limit of small 9, 

T N-2 t 

b N = A Q N (12) 

where In is defined in equation (|8|) above. Thus, by measuring S3 7 and 7 for conic cells one can use the 
above relations to estimate the corresponding Q3 or Sf in real space. 

4 Data analysis results 
4.1 Catalogues 

We use three different catalogs of galaxies: 

• i) the North Zwicky Center for Astrophysics catalog (Huchra et al. 1983, hereafter CfA), with mg< 
14.5, 5 > 0, and b > 40°, which has a solid angle of 1.83 sr. 

• ii) the Southern Sky Redshift Survey (Da Costa et al. 1991, hereafter SSRS) of diameter-selected 
galaxies with logd(0) > 0.1 from the ESO catalog with 5 < —17.5°, b 11 < —30°; the solid angle is 
1.75 sr. 
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Hi) the full-sky coverage redshift survey of galaxies from the IRAS (Infrared Astronomical Satellite) 
database (Strauss et al. 1992, hereafter IRAS) with limiting flux density of 1.936 Jy at 60 //m and 
galactic latitude \b\ > 5°; the solid angle is 11.0 sr. 



We select four different volume- limited subsamples from each catalog, as shown in Tables 1-3, where 
zn m is the maximum redshift, A^ot is the total number of galaxies, Vp is the volume of each sample, and Mb, 
dii m , and z 2 feo are the limiting absolute magnitude, "face-on" diameter, and flux, respectively. Lengths in 
the tables quoted as "Mpc" use Hq = 100. To produce fair volume-limited samples we have not included 
the fainter or smaller galaxies, so that the space in each sample is homogeneously filled. For example, 
each CfA sample includes galaxies brighter than Mb = rriB — 25 — 5 log(zii m /i?o), where tub = 14.5 is 
the limiting apparent magnitude. In addition, galaxies in the CfAN50 sample are restricted to be fainter 
than Mb = —20.3, so that this sample is independent of CfAN90. In the SSRS catalogue, each sample 
includes galaxies with physical diameter greater than dn m = zn m 9 cu t / HQ,where 9 cn t = 1'.26 is the "face-on" 
diameter cut-off. In the IRAS catalogue, each sample includes galaxies with absolute flux z 2 /qq greater 
than the limiting flux z 2 im 1.936 Jy. 

Heliocentric redshifts are corrected only for our motion with respect to the rest frame of the Cosmic 
Microwave Background; v = 365kms _1 toward (a, S) = (11. 2 h ,— 7°) (Smoot et al. 1991). Galaxies with 
redshifts smaller than 2500kms _1 are excluded, since at these scales typical peculiar velocity flows (of 
several hundreds of km/s) can compete with the general recession velocity. 

4.2 Count-in-cell volumes 

For each sample, we want to estimate the count-in-cell probabilities P n as a function of the cell volume. 
The total number of degrees of freedom is limited by the density n of the sample and its total volume Vt, 
so that the size of the cell is bounded bellow by n and above by Vt- The number of different cell sizes 
and their intervals should not be set arbitrarily, as the resulting moments or correlations at each point 
would not be independent. For samples of small density, a good criterion for statistically independent 
sizes is Vi = i/n, for i = 1,2, . . ., as then each new volume includes on average one more galaxy. Notice 
how different this is from using equal spacing in the radius of spherical cells for all samples; this not only 
introduces a different weighting but also imposes an arbitrary number of degrees of freedom, regardless 
of the size or density of the sample. In our analysis we use equal volume spacing in units of V\ = 1/n. 
We use half this unit to study the fluctuations or when we have only few points. The largest cell to be 
used is limited by the boundaries of the sample. Using different boundary conditions, Gaztahaga (1992) 
and Gaztahaga &: Yokoyama (1993) have found that a good phenomenological bound is V < Vr/3 3 , so 
that the diameter of the cell is at most one third of the typical size of the sample. This is just one upper 
limit; we also discard the larger cells of our analysis when the estimated uncertainties, from equation (||) 
are too large. The resulting range of scales displayed in Tables 4-6 is different for each sample. The larger 
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samples can accommodate larger cells, but also have smaller densities and thus do not have information 
on the smaller scales. 



4.3 Correlation functions 

We estimate the correlation functions using the moments of counts- in-cells as explained in section § 2. 
Figures la-c show the volume average two-point correlation function, £2, as a function of the size of the 
cell. Open squares correspond to spherical ( "redshift" ) cells and triangles correspond to conic ( "real space" ) 
cells. The scale of the cell, given in Mpc with Hq = 100, corresponds to the radius R of the redshift-space 
sphere or the radius of the base of the conical sector, 9D with D the depth of the sample. The error of each 
point is the maximum between the systematic error (q) from correcting the count distribution tail for finite 
volume and the statistical error from the P n propagated to the moments. The dashed line in each figure 
is the best fit to a power law, weighted by the errors. The agreement with a power law model justifies our 
assumption of the relations in section § 3. The slopes for the conic cells (triangles) and spherical shells 
(squares) follow what we expect from the projection, ^ ~ O 1 ^ 1 D^ 1 and £f ~ Rr" 1 . In Tables 4-6 we 
display the values of the fit in terms of the slope 7 and correlation length ro of the intrinsic correlation 
function £,2(1") = ( r o/ r ) 7 using equation (|7|) and equation (||). Note that the range of scales where the fit 
is done is different for each sample, as explained above. We quote both the statistic and the systematic 
error (the last in parenthesis). The errors in the fitted parameters, ro and 7, reflect both the goodness of 
the fit and the statistic (or systematic) error in the estimated correlations; both contributions are usually 
of comparable magnitude. 

Figures 2a-c show ^(R) (triangles) and £±(R) (squares) for spherical (redshift) cells, plotted against 
^2(R) f° r ah samples. In Figures 3a-c we present a similar plot for conical (real space) cells. Again, the 
error plotted at each point is the maximum between the systematic error (eq. ||) and the statistical error 
from the P n propagated to the moments. The dashed line is the best fit to £3 = S3 £| an d £4 = S4 £2 
weighted by the errors. The agreement of these power law models is evidence in favor of the hierarchical 
model and, again, justifies our assumptions in section § 3. 

Tables 4-6 shows the parameters £3 and S4 of the fit. The values of S3 for conical cells have been 



scaled to the corresponding value of a spherical cell using equation (11) and equation (|12|) , so that real 
space and redshift space amplitudes can be directly compared. The correction to convert S3 into S3 is 
smaller than 15% so that we have not propagated the uncertainties from 7 in this conversion. Because this 
correction turns out to be small and there are large uncertainties in S4 we have not tried to calculate the 
exact correction in the case of the 4-point function, but have just scaled the correction found for S3 using 
the dimensional argument S4/S4 ~ [S3 /S3'] 2 . We quote both the statistic and the systematic error (the 
last in parenthesis) . The errors in the fitted parameters S3 and S4 reflect both the goodness of the fit and 
the statistic (systematic) error in the estimated correlations; both contributions are usually of comparable 
magnitude. Samples IRAS65 and IRAS80 have a very noisy signal for spherical cells and results are not 
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quoted. 

The average values reported in Tables 4-6 are the mean values for all samples within a catalog, weighted 
by l/of (using the maximum between the statistic and systematic errors). The first error in the average is 
the weighted dispersion about the mean, scaled to a 1-a interval after accounting for the difference between 
the Gaussian and Student-i distribution. For most averages, with four degrees of freedom, the dispersion 
is increased by the factor 1.14163. In parentheses we also quote the expected error, 1/ XXV°f)- These 
two are the same when (reduced) x 2 = 1. 

4.4 Redshift distortions 

In the linear regime Kaiser (1987) has derived the expected ratio of two-point function in redshift space, 
£2(3), to that in real space, ^( r ) : 

where b is the bias factor between matter fluctuations and galaxy fluctuations, 5 g = b5 m . Lilje & Efstathiou 
(1989) have shown that these redshift distortions are expected even on mildly non-linear scales where 
£2(0 ~ 1- Non-linear effects and the "fingers of God" for smaller scales tend to suppress the redshift 
correlations, so that £2(3) /fair) becomes smaller than 1 for small separations (Davis &z Peebles 1983). 
Therefore, even if we have not reached the scales where the linear theory should apply (whatever this scale 
is), we should be able to place a lower bound on J7 4//7 /6 from the maximum redshift distortion we are able 
to detect. 

As will become apparent, we can not use our data to make an accurate determination of this effect, but 
we can estimate the order of magnitude. At some point the power laws we have observed for the real and 
redshift two-point correlations will break to reproduce Kaiser's results, a constant ratio of £2(5) to £2(1"). 
The ratio we see on larger scales is not constant, but rising with scale, except in one case. Even though in 
our analysis this ration has not reached a constant value, presumably because of poor sensitivity at large 
scales, we can use the extra power observed in the redshift correlation to place a lower bound on the factor 
n^ 7 /b, and, if fl = 1, an upper bound on b. Using the values of ro, sq and 7 shown in Tables 4-6, we 
estimate the ratio £2(5) / £2(1") at ro and at the scale r max corresponding to the largest spherical (redshift) 
cell in each sample. Table 7 shows r max , the value of the intrinsic two-point (real-space) correlation, the 
correlation strength at that scale, ^( r max)) an d the resulting values of S~2 4y/7 /6 resulting from equation (13). 



As argued above, the ratio £,2(s) / £2(1") increases towards larger scales, and thus the final column represents 
our strongest lower bound on f2 4 / 7 /6. Errors in fi 4 / 7 /6 are propagated from the errors in ro, so, and 7 
(again, the maximum between the statistic and systematic errors). The average values in Table 7 are the 
mean values for all samples within a catalog, weighted by the errors. The negative values of S~2 4//7 /6 for 
CfAN65 do not affect the average value much, but do increase the estimated dispersion by a factor of 2. 
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5 Discussion 



In this work, we have computed volume-averaged correlation functions £2 , £3 and £4 for conical and spherical 
configurations that allow us to observe the effects of peculiar velocity distortions. We have considered 
volume-limited samples so that no assumption has to be made about a luminosity function. We make 
use all the degrees of freedom in each sample and weight them properly. To account for boundary and 
finite size effects we have introduced a modeling of the tail of the distribution of fluctuations which helps 
to improve the signal and estimate the uncertainties. We report the difference between results obtained 
with and without this modeling of the tail as a systematic error, which we have taken as the minimum 
uncertainty in our results, a conservative approach that leaves our conclusions safely unaffected by our 
assumptions. To simplify comparisons of the analyses of different samples we have reduced the observed 
correlation functions, ^(R), £,3(R), and d(R) in Figures 1-3, to a few parameters shown in Tables 4-6. 

Our analysis reproduces the already known trend (Davis et al. 1988) that the amplitude sq of the 
two-point redshift correlation function of optical galaxies grows with the depth of the survey, at least up to 
a redshift of 8000 km s -1 . This trend is confirmed here for both the Southern and Northern galactic hemi- 
spheres for optical selected galaxies, i.e. in both the SSRS and CfA catalogs; within our poor resolution, it 
is not observed in the IRAS samples. As a new result, we find that this scaling with depth seems to occur 
also in real space, which indicates that it can not just be a consequence of redshift distortions. We also 
find that for depths larger than 8000 km s -1 the trend is broken: the observed correlation length decreases 
in the 9000kms _1 samples. This indicates that the effect may be accidental or caused by sampling an 
"unfair" local volume, but does not imply a simple "fractal" or inhomogeneous behavior. 

For the average values and even for separate subsamples within each catalog, the intrinsic 2-point 
correlation length tq of the CfA and SSRS are very similar. The relative amplitude of the IRAS and 
CfA two-point correlations is in very good agreement with other analyses (Strauss et al. 1992; Saunders, 
Rowan-Robinson, & Lawrence 1992) although the techniques and samples used are quite different. There 
is a significant difference between the average slope 7 of £2 ~ r_7 in real space between the SSRS and 
the CfA or IRAS catalogues. The slope in the SSRS, 7 = 1.62 ± 0.04, is lower than the "standard" value 
of 7 = 1.8, but the errors are quite large. SSRS diameter magnitudes are more subject to plate-to-plate 
correction which might introduce some large scale power. There is also a very large void in this catalog 
(larger than any in the CfAl). 

In general, there is a characteristic redshift distortion on £2( r ) : the slope 7 is flatter and the correlation 
length is larger in redshift space than in real space, that is, redshift distortions "move" correlations from 
small to large scales. In particular, for the average values, going from real to redshift space reduces the 
slope of £2 by 30% (SSRS), 20% (IRAS) and 13% (CfA). At the largest scales the extra power in the 
redshift distribution is compatible with a f2 4/,7 /6 ~ 1, as seen in Table 7. As expected, the average values 
of S~2 4 / 7 /6 for each catalog are correlated with the amount of reduction in the redshift slope. Our results 
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for n^ 7 /b in the CfA are much larger than those deduced by Lilje h Efstathiou (1989) from Figure 3 
of Davis & Peebles (1983), perhaps because they only use scales up to 7ft, -1 Mpc, whereas ours reach to 
12 ft" 1 Mpc. 

The higher order correlations £3 and £4 are redshift distorted also, but in such a way that the hierarchical 
amplitudes 53 and £4 are insensitive to these distortions, within the resolution of our analysis. This is in 
agreement with the numerical studies of Coles et al. (1993). It follows that redshift distortions are probably 
not responsible for the discrepancies in the values of S3 and S4 found in some of the previous analyses, 
summarized in Table 8. Moreover, the hierarchical amplitudes S3 and S4 are constant as a function of scale 
between 1-20 h^ 1 Mpc (which includes the transition between linear and non-linear scales), and they have 
a similar value in all samples, even for different catalogues, despite that £2, I3 and £4 differ from one sample 
to another by large factors (up to factors of 4 in £2, or 8 for £3, or 12 for £4). This is remarkable given 
the different criteria in the selection of galaxies and also the difference in the resulting range of densities, 
luminosities and locations between samples. 
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TABLE 1 
CfA Samples 

Sample z hm (km/s) M B A" gal V T (Mpc 3 ) 

CfAN50 5000 [-20.3,-19.0] 206 6.7 x 10 4 

CfAN65 6500 < -19.5 208 1.6 x 10 5 

CfAN80 8000 < -20 207 3.0 x 10 5 

CfAN90 9000 < -20.3 146 4.4 x 10 5 



TABLE 2 
SSRS Samples 

Sample z hm (km/s) d Um (kpc) 7V gal V T (Mpc 3 ) 

SSRS50 5000 18.3-33 222 6.7 x 10 4 

SSRS65 6500 > 22 283 1.5 x 10 5 

SSRS80 8000 > 29.3 220 2.9 x 10 5 

SSRS90 9000 > 33 203 4.2 x 10 5 



TABLE 3 
IRAS Samples 

Sample z lim (km/s) z 2 / 60 (Jy Mpc 2 ) 7V gal V T (Mpc 3 ) 

IRAS45 4500 > 3920 319 3.3 x 10 5 

IRAS55 5500 > 5856 404 6.1 x 10 5 

IRAS65 6500 > 8180 359 1.0 x 10 6 

IRAS80 8000 > 12390 314 1.9 x 10 6 
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TABLE 4a 
CfA Results, Real Space 



Sample 


6zu m 


(Mpc) 


7 


r (Mpc) 


S3 


5 4 


CfAN50. . . . 


1 


-6 


1.88 ±0.03 (0.05) 


3.68 ±0.11 (0.21) 


1.98 ±0.22 (0.70) 


6.0 ±3.6 (4.1) 


CfAN65.... 


1 


-8 


1.82 ± 0.02 (0.04) 


4.98 ±0.10 (0.17) 


1.87 ±0.13 (0.52) 


5.6 ±4.8 (3.1) 


CfAN80.... 


2^ 


10 


1.76 ±0.05 (0.09) 


7.09 ± 0.53 (0.88) 


1.91 ±0.18 (0.50) 


5.4 ±2.2 (1.3) 


CfAN90.... 


3- 


-14 


2.26 ±0.13 (0.12) 


5.75 ± 0.70 (0.64) 


2.03 ± 0.45 (0.43) 


9.8 ±3.7 (3.9) 


Average .... 


1- 


-14 


1.86 ± 0.07 (0.03) 


4.56 ±0.50 (0.13) 


1.95 ±0.04 (0.26) 


6.3 ±1.1 (1.6) 



TABLE 46 

CfA Results, Redshift Space 



Sample 


R (Mpc) 


7 


so (Mpc) 


S 3 


S 4 


CfAN50.... 


2-7 


1.62 ±0.08 (0.10) 


4.47 ±0.27 (0.33) 


1.77 ±0.13 (0.33) 


4.9 ±0.9 (1.7) 


CfAN65.... 


3-8 


1.89 ±0.23 (0.36) 


4.81 ±0.86 (1.33) 


1.57 ±0.16 (0.53) 


2.9 ±1.0 (2.7) 


CfAN80.... 


4-13 


1.56 ±0.06 (0.10) 


8.13 ±0.64 (1.06) 


2.34 ±0.14 (0.62) 


7.8 ± 1.0 (3.5) 


CfAN90.... 


8-16 


1.82 ±0.22 (0.45) 


6.83 ± 1.61 (3.31) 


2.57 ±0.29 (0.85) 


10.2 ±2.4 (6.1) 


Average .... 


2-16 


1.61 ±0.05 (0.07) 


4.81 ±0.67 (0.31) 


1.88 ±0.31 (0.24) 


5.1 ±1.2 (1.3) 
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TABLE 5a 
SSRS Results, Real Space 



Sample 


9z lim (Mpc) 


7 


r (Mpc) 


S3 


5 4 


SSRS50 


1-6 


1.64 ±0.02 (0.03) 


3.60 ± 0.33 (0.45) 


1.72 ±0.14 (0.30) 


4.8 ±4.7 (1.6) 


SSRS65 


1-8 


1.57 ±0.01 (0.03) 


5.27 ±0.14 (0.47) 


2.09 ±0.15 (0.82) 


6.2 ±4.6 (4.4) 


SSRS80 .... 


2-7 


1.64 ±0.09 (0.10) 


5.69 ±0.57 (0.69) 


1.77 ±0.18 (0.23) 


5.0 ±3.6 (1.8) 


SSRS90 .... 


2-12 


1.71 ±0.04 (0.06) 


4.80 ±0.24 (0.36) 


1.78 ± 0.48 (0.93) 


6.3 ±2.3 (5.7) 


Average .... 


1-12 


1.62 ±0.03 (0.02) 


4.70 ±0.46 (0.23) 


1.77 ±0.05 (0.17) 


5.4 ±0.4 (2.2) 



TABLE 56 
SSRS Results, Redshift Space 



Sample 


R (Mpc) 


7 


so (Mpc) 


S 3 


S 4 


SSRS50 


2-6 


1.32 ±0.08 (0.11) 


4.79 ±0.35 (0.51) 


1.65 ±0.10 (0.33) 


4.0 ±0.5 (1.8) 


SSRS65 


3-9 


1.05 ±0.02 (0.05) 


9.16 ±0.32 (0.77) 


1.97 ±0.06 (0.48) 


5.5 ±0.4 (2.2) 


SSRS80 .... 


4-10 


1.30 ±0.11 (0.13) 


9.15 ± 1.41 (1.71) 


1.97 ±0.11 (0.35) 


5.6 ±0.8 (1.7) 


SSRS90 .... 


5-11 


1.25 ± 0.22 (0.25) 


8.15 ±2.49 (2.85) 


1.83 ±0.17 (0.37) 


5.3 ± 1.4 (2.2) 


Average .... 


2-11 


1.12 ±0.08 (0.04) 


6.33 ±1.36 (0.41) 


1.83 ±0.09 (0.19) 


5.1 ±0.5 (1.0) 
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TABLE 6a 
IRAS Results, Real Space 



Sample 




(Mpc) 


7 


r (Mpc) 


S3 


5 4 


IRAS45 .... 


2 


-13 


1.89 ± 0.02 (0.04) 


3.85 ±0.15 (0.27) 


2.06 ± 0.25 (0.38) 


7.4 ±6.1 (2.9) 


IRAS55 .... 


2- 


-17 


1.75 ±0.02 (0.03) 


4.04 ±0.10 (0.18) 


2.44 ± 0.52 (0.48) 


9.4 ±5.9 (4.0) 


IRAS65 .... 


3- 


-20 


1.76 ±0.11 (0.15) 


3.80 ± 0.44 (0.60) 


2.33 ± 1.35 (0.69) 


11.7 ±9.6 (9.2) 


IRAS80 .... 


4- 


-16 


1.87 ±0.21 (0.32) 


4.00 ±1.23 (1.14) 


3.41 ±2.11 (1.22) 


24 ± 34 (23) 


Average .... 


2- 


-20 


1.80 ± 0.04 (0.02) 


3.97 ±0.06 (0.14) 


2.22 ±0.16 (0.30) 


9.2 ± 1.5 (3.9) 



TABLE 6b 
IRAS Results, Redshift Space 



Sample 


R (Mpc) 


7 


so (Mpc) 


S3 


S 4 


IRAS45 .... 


3-9 


1.36 ±0.23 (0.23) 


5.37 ± 1.23 (1.20) 


2.28 ±0.41 (0.25) 


8.8 ±3.4 (2.2) 


IRAS55 .... 


4-10 


1.50 ±0.17 (0.15) 


4.65 ± 0.90 (0.79) 


1.93 ±0.21 (0.33) 


6.6 ± 1.5 (2.8) 


Average .... 


3-10 


1.44 ±0.09 (0.14) 


4.90 ± 0.45 (0.73) 


2.07 ±0.23 (0.26) 


7.5 ±1.4 (2.1) 
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TABLE 7 

Results for ^ jb 

Sample r max (Mpc) &(r max ) ^ 4/ 7& M ^ 4/7 /Mw) 

CfA50 5.18 0.53 0.48 ± 0.24 0.63 ± 0.26 

CFA65 5.52 0.83 -0.10 ±0.79 -0.16 ±0.78 

CfA80 9.75 0.57 0.33 ± 0.44 0.43 ± 0.47 

CfA90 11.3 0.22 0.48 ± 1.45 0.98 ±1.61 

CfA Average ... . 0.41 ±0.10 0.53 ±0.15 

SSRS50 4.73 0.64 0.58 ± 0.35 0.73 ± 0.41 

SSRS65 7.44 0.58 0.92 ± 0.23 1.24 ±0.31 

SSRS80 7.91 0.58 0.99 ± 0.52 1.19 ±0.57 

SSRS90 8.79 0.36 1.07 ±0.82 1.59 ±0.90 

SSRS Average .. . 0.85 ±0.10 1.10 ±0.16 

IRAS45 7.04 0.32 0.71 ± 0.56 1.27 ±0.64 

IRAS55 7.60 0.33 0.32 ± 0.46 0.57 ±0.50 

IRAS Average . . . 0.48 ± 0.25 0.84 ± 0.45 
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Table 8 
Estimates for S3 and S4 



Reference 


Sample 


S3 or 3Q 3 


S4 or I6Q4 


Groth & Peebles 1977 . . . 


Lick-Zwicky 


3.9 ±0.6 


— 


Fry & Peebles 1978 


Lick-Zwicky 


— 


46 ± 8 


Szapudi et al. 1992 


Lick 


4.3 ±0.2 


31 ± 5 


Peebles 1980 


CfA 


2.4 ±0.2 


— 


This paper 


CfA 


2.0 ±0.3 


6.3 ± 1.6 


This paper 


SSRS 


1.8 ±0.2 


5.4 ±2.2 


Meiksin et al. 1992 


IRAS 


2.2 ±0.2 


10 ± 3 


This paper 


IRAS 


2.2 ±0.3 


9.2 ±3.9 


Gaztahaga 1992 


CfA (redshift) 


1.9 ±0.1 


4.1 ±0.6 


This paper 


CfA (redshift) 


1.9 ±0.2 


5.1 ± 1.3 


Gaztahaga 1992 


SSRS (redshift) 


2.0 ±0.1 


5.0 ±0.9 


This paper 


SSRS (redshift) 


1.8 ±0.2 


5.2 ± 1.3 


Bouchet et al. 1993 


IRAS (redshift) 


1.5 ±0.5 


4.4 ±3.7 


This paper 


IRAS (redshift) 


2.1 ±0.3 


7.5 ±2.1 
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6 Figure Captions 



Fig. 1. — Average correlation function, £2 for each sample in the CfA (a), SSRS (6) and IRAS (c) catalogue. 
Open squares correspond to £2 for spherical (redshift) cells as a function of the radius R of the cell. Filled 
triangles correspond to £2 f° r conical (real space) cells as a function of the radius 9z at the base of the 
cone. The dashed lines are the best power-law fit weighted by the errors. 

Fig. 2. — Values of £3 (triangles) and £4 (squares), as a function of £2 for spherical (redshift) cells. There 
is one graph for each sample in the CfA (a), SSRS (b) and IRAS (c) catalogue. The dashed lines are the 
hierarchical law: £j = Sj^' 1 where S3 and £4 are fitted with the data weighted by the errors. 

Fig. 3. — Values of £3 (triangles) and £4 (squares), as a function of £2 for conical (real space) cells. There 
is one graph for each sample in the CfA (a), SSRS (6), and IRAS (c) catalogue. The dashed lines are the 
hierarchical law: £j = Sjf^ _1 where 53 and £4 are fitted with the data weighted by the errors. 
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